I I Changed a f il4 from non-ASCII to ASCII 



CRF Processing Dat 
Edited by: 
Verified by: 



□ate: ^ 
7^ 



□ 
□ 

□ 
□ 

3 



□ 
□ 
□ 
□ 
□ 
□ 

□ 
□ 

□ 

□ 
□ 



Changed the margins in cases where the sequence text was "wrapped" down to the next line. 
Edited a format error in the Current Application Data section, spej^icajly: 



; (STIC staff) 




Edited the Current Application Data section with the actual current number, 
applicant was □ the prior application data; or □ other 



;iiQm%r teDutted by the 



Added the mandatory heading and subheadings for "Current Application Data". 

Edited the "Number of Sequences" field. The applicant spelled out a number instead of using an integer. 

Changed the spelling of a Ra«€!s^fy field (the headings or subheadings), specifically: 

6- W./^V/5;^" 

Corrected the SEQ ID NO when obviously incorrect. The sequence numbers that were edited were: 



Inserted or con-ected a nucleic number at the end of a nucleic line. SEQ ID NO's edited: 



Corrected subheading placement. At! responses must be on the same line as each subheading, if the 
applicant placed a response below the subheading, this was moved to its appropriate place. 

Inserted colons after headings/subheadings. Headings edited included: 



Deleted extra, invalid, headings used by an applicant, specifically: 



Deleted: □ non-ASCII "gartage" at the beginning/end of files; □ secretary inrtials/filename at end of file; 
□ page numbers throughout text; □ other invalid text, such as - . 

Inserted mandatory headings, specifically: 



Corrected an obvious error in the response, specifically: 



Edited identifiers where upper case is used but lower case is required, or vice versa. 
Corrected an error in the Number of Sequences field, specifically: 

A "Hard Page Break" code was inserted by the applicant. All occurrences had to be deleted. 

Deleted ending stop codon in amino acid sequences and adjusted the "(A)Length:" field accordingly (error 
due to a Patentin bug). Sequences corrected: 



*Exaimiin©ir: Th© afijov© correctioins must b© commynicat©d to th© applicanit iiini tlhi© f Hirst OWIc© 
Actiomio DO MOT s©inid a copy of thiis form. 3/^^95 



Appiicati n No.: 



NOTICE TO COMPLY WITH REQUIREMENTS FOR PATENT APPLICATIONS CONTAINING 
NUCLEOTIDE SEQUENCE AND/OR AMINO ACID SEQUENCE DISCLOSURES 

TTie nucieotide and/or amino acid sequence disddsure contained in this appiicatidh does not 
comply with the requirements for such a disclosure as set forth in 37 C.F.R. 1.821 - 1.825 for 
the following re^n(s): 

iH^4^ Appicanfs 
attention is directed to these regulations, pubGshed at 1114 OG 29, May 15, 1990 and at 55 FR 
18230, May 1, 1990. 



iRi i ^'^'^ ? separate part of^ttie.^^ 

LJ Ustir^ as requr^ by 37 C.F.R. 1.821(c). «^ . v. 

□ 



3. A copy of the 'Sequence Usting" in computer resKlable form has rxit been submitted as reqiired by 
37 C.F.R. 1.821(e). 



□ 

□ 

□ 
0 



4. A copy of the 'Sequence Listing^ in computer readable form has been submitted. However, the 
content of the computer readable form does not cornpiy 

and/or i .823, as incPrated on the attached copy of the marked nip *Raw Sequence Listing.* 

5. The computer readable form that has been filed v^th this appGcation has been found to be damaged 
and/or unreadat>le as indicated on the attached CRF Diskette Problem Report A Sut>stitute 
computer readable form must be submHted as reqiired by 37 C.F.R. 1.825(d). 

6. The paper copy of the 'Sequence Listing' is not, the same as the computer readatde from of th 
'Sequence Listing' as required by 37'C.FiR. 1.821(e). 

7. Other ' - 



Applicant Must Provide: 

/r\/^ initial or substitute computer readable form (CRF) copy of the 'Sequertce Listing'. 



tJ ent 



An initial or^ubstitute paper copy of the 'Sequence Listing', as well as an amendment Greeting its 
entry xr^Br^ specification. 




statement that the content of the paper and computer readalsle copies are the same and, wh re 
applicable, include no new matter, as required by 37 C.F.R. 1.821(e) or 1.821(0 or 1.821(g) or 
1.825(b) or 1.825(d). 

For questions regarding compliance to these requirements, please contact 

For Rules interpretation, call (703) 308-4216 
For CRF Submission Help, call (703) 308-4212 
For Patentin sottWare help, call (703) 308-6856 



PLEASE RETURN A COPY OF THIS NOTICE WITH YOUR RESPONSE 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/842,898 



DATE: 01/14/98 
TIME: 16:39:38 



INPUT SET: S22SS6.mw 



This Raw listing contains the General 
Information Section and up to the^^ 5 pages. 



SEQUENCE LISTING 





General Information: 



(i) APPLICANT: BROEKAERT, WILLEM F. 



CAMMUE, BRUNO P. A, 
OSBORN, RUPERT W. 
REES, SARAH B. 



(ii) TITLE OF INVENTION: ANTIMICROBIAL PROTEINS 
(iii) NUMBER OP SEQUENCES: 13 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: PILLSBURY MADISON & SUTRO LLP 

(B) STREET: 1100 New York Avenue, N*W. 

(C) CITY: Washington 

(D) STATE: D.C. 

(E) COUNTRY: U.S.A, 

(F) ZIP: 20005-3918 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Microsoft Word 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER:US/08/842, 898 

(B) FILING DATE: 22-OCT-1996 

(C) CLASSIFICATION: 536 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/656,318 

(B) FILING DATE: 12-JUN-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/GB94/02766 

(B) FILING DATE: 19-DEC-1994 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: GB 9326424.0 

(B) FILING DATE: 24-DEC-1993 



2) INFORMATION FOR SEQ ID NO: 1: 



PAGE: 2 



47 
48 
49 
50 
51 
52 
53 
54 
55 
56 
57 
58 
59 
60 
61 
62 
63 
64 
65 
66 
67 
68 
69 
70 
71 
72 
73 
74 
75 
76 
77 
78 
79 
80 
81 
82 
83 
84 
85 
86 
87 
88 
89 
90 
91 
92 
93 
94 
95 
96 
97 
98 
99 



RAW SEQUENCE LISTING DATE: 01/14/98 

PATENT APPLICATION US/08/842,898 TIME: 16:39:41 

INPUT SET: S22SS6.mw 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: Hs-AFPl 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Asp Gly Val Lys Leu Cys Asp Val Pro Ser Gly Thr Trp Ser Gly His 
15 10 15 

Cys Gly Ser Ser Ser Lys Cys Ser Gin Gin Cys Lys Asp Arg Glu His 
20 25 30 

Phe Ala Tyr Gly Gly Ala Cys His Tyr Gin Phe Pro Ser Val Lys Cys 
35 40 45 

Phe Cys Lys Arg Gin Cys 
50 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: Ah- AMP 1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Leu Cys Asn Glu Arg Pro Ser Gin Thr Trp Ser Gly Asn Cys Gly Asn 
15 10 15 

Thr Ala His Cys Asp Lys Gin Cys Gin Asp Trp Glu Lys Ala Ser His 
20 25 30 

Gly Ala Cys His Lys Arg Glu Asn His Trp Lys Cys Phe Cys Tyr Phe 
35 40 45 

Asn Cys 
50 



PAGE: 3 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/842,898 



DATE: 01/14/98 
TIME: 16:39:44 



INPUT SET: S22SS6.mw 

100 
101 

102 (2) INFORMATION FOR SEQ ID NO: 3: 

103 (i) SEQUENCE CHARACTERISTICS: 

104 (A) LENGTH: 51 amino acids 

105 (B) TYPE: amino acid 

106 (C) STRANDEDNESS : single 

107 (D) TOPOLOGY: linear 
108 

109 (ii) MOLECULE TYPE: protein 
110 

111 (vi) ORIGINAL SOURCE: 

112 (A) ORGANISM: Rs-AFPl 
113 

114 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
115 

116 Glx Lys Leu Cys Glu Arg Pro Ser Gly Thr Trp Ser Gly Val Cys Gly 

117 1 5 10 15 
118 

119 Asn Asn Asn Ala Cys Lys Asn Gin Cys lie Asn Leu Glu Lys Ala Arg 

120 20 25 30 
121 

122 His Gly Ser Cys Asn Tyr Val Phe Pro Ala His Lys Cys lie Cys Tyr 

123 35 40 45 
124 

125 Phe Pro Cys 

126 50 
127 

128 

129 (2) INFORMATION FOR SEQ ID NO: 4: 
130 

131 (i) SEQUENCE CHARACTERISTICS: 

132 (A) LENGTH: 51 amino acids 

133 (B) TYPE: amino acid 

134 (C) STRANDEDNESS: single 

135 (D) TOPOLOGY: linear 
136 

137 (ii) MOLECULE TYPE: protein 
138 

139 (vi) ORIGINAL SOURCE: 

140 (A) ORGANISM: RS-AFP2 
141 

142 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

143 

144 Glx Lys Leu Cys Gin Arg Pro Ser Gly Thr Trp Ser Gly Val Cys Gly 

145 15 10 15 
146 

147 Asn Asn Asn Ala Cys Lys Asn Gin Cys lie Arg Leu Glu Lys Ala Arg 

148 20 25 30 
149 

150 His Gly Ser Cys Asn Tyr Val Phe Pro Ala His Lys Cys lie Cys Tyr 

151 35 40 45 
152 



PAGE: 4 RAW SEQUENCE LISTING DATE: 01/14/98 

PATENT APPLICATION US/08/842,898 TIME: 16:39:48 

INPUT SET: S22S56.mw 

153 Phe Pro Cys 

154 50 
155 

156 

157 (2) INFORMATION FOR SEQ ID NO: 5: 
158 

159 (i) SEQUENCE CHARACTERISTICS: 

160 (A) LENGTH: 50 amino acids 

161 (B) TYPE: amino acid 

162 (C) STRANDEDNESS : single 

163 (D) TOPOLOGY: linear 
164 

165 (ii) MOLECULE TYPE: protein 
166 

167 (vi) ORIGINAL SOURCE: 

168 (A) ORGANISM: Dm-AMPl 
169 

170 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
171 

172 Glu Leu Cys Glu Lys Ala Ser Lys Thr Trp Ser Gly Asn Cys Gly Asn 

173 ,1 5 10 15 
174 

175 Thr Gly His Cys Asp Asn Gin Cys Lys Ser Trp Glu Gly Ala Ala His 

176 20 25 30 
177 

178 Gly Ala Cys His Val Arg Asn Gly Lys His Met Cys Phe Cys Tyr Phe 

179 35 40 45 
180 

181 Asn Cys 

182 50 
183 

184 

185 (2) INFORMATION FOR SEQ ID NO: 6: 
186 

187 (i) SEQUENCE CHARACTERISTICS: 

188 (A) LENGTH: 50 amino acids 

189 (B) TYPE: amino acid 

190 (C) STRANDEDNESS: single 

191 (D) TOPOLOGY: linear 
192 

193 (ii) MOLECULE TYPE: protein 
194 

195 (vi) ORIGINAL SOURCE: 

196 (A) ORGANISM: Cb-AMPl 
197 

198 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
199 

200 Glu Leu Cys Glu Lys Ala Ser Lys Thr Trp Ser Gly Asn Cys Gly Asn 

201 15 10 15 
202 

203 Thr Lys His Cys Asp Asp Gin Cys Lys Ser Trp Glu Gly Ala Ala His 

204 20 25 30 
205 



PAGE: 5 



206 
207 
208 
209 
210 
211 
212 
213 
214 
215 
216 
217 
218 
219 
220 
221 
222 
223 
224 
225 
226 
227 
228 
229 
230 
231 
232 
233 
234 
235 
236 
237 
238 
239 
240 
241 
242 
243 
244 
245 
246 
247 
248 
249 
250 
251 
252 
253 
254 
255 
256 
257 
258 



RAW SEQUENCE LISTING DATE: 01/14/98 

PATENT APPLICATION US/08/842,898 TIME: 16:39:51 

INPUT SET: S22556.inw 
Gly Ala Cys His Val Arg Asn Gly Lys His Met Cys Phe Cys Tyr Phe 
35 40 45 



Asn Cys 
50 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Vi) ORIGINAL SOURCE: 
(A) ORGANISM: Cb-AMPl 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Asn Leu Cys Glu Arg Ala Ser Leu Thr Trp Thr Gly Asn Cys Gly Asn 
15 10 15 

Thr Gly His Cys Asp Thr Gin Cys Arg Asn Trp Glu Ser Ala Lys His 
20 25 30 

Gly Ala Cys His Lys Arg Gly Asn Trp Lys Cys Phe Cys Tyr Phe Asp 
35 40 45 

Cys 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: Lc-AFP 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Lys Thr Cys Glu Asn Leu Ser Gly Thr Phe Lys Gly Pro Cys lie Pro 
15 10 15 



PAGE: 6 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/842,898 



DATE: 01/14/98 
TIME: 16:39:55 



INPUT SET: S225S6.mw 



****** PREVIOUSLY ERRORED SEQUENCES - EDITED ***** 



102 (2) INFORMATION FOR SEQ ID NO: 3: 

103 (i) SEQUENCE CHARACTERISTICS: 

104 (A) LENGTH: 51 amino acids 

105 (B) TYPE: amino acid 

106 (C) STRANDEDNESS: single 

107 (D) TOPOLOGY: linear 
108 

109 (ii) MOLECULE TYPE: protein 
110 

111 (vi) ORIGINAL SOURCE: 

112 (A) ORGANISM: Rs-AFPl 
113 

114 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
115 

116 Glx Lys Leu Cys Glu Arg Pro Ser Gly Thr Trp Ser Oly Val Cys Gly 

117 15 10 15 
118 

119 Asn Asn Asn Ala Cys Lys Asn Gin Cys lie Asn Leu Glu Lys Ala Arg 

120 20 25 . 30 
121 

122 His Gly Ser Cys Asn Tyr Val Phe Pro Ala His Lys Cys lie Cys Tyr 

123 35 40 45 
124 

125 Phe Pro Cys 

126 50 
127 

128 

129 (2) INFORMATION FOR SEQ ID NO: 4: 
130 

131 (i) SEQUENCE CHARACTERISTICS: 

132 (A) LENGTH: 51 amino acids 

133 (B) TYPE: amino acid 

134 (C) STRANDEDNESS: single 

135 (D) TOPOLOGY: linear 
136 

137 (ii) MOLECULE TYPE: protein 
138 

139 (vi) ORIGINAL SOURCE: 

140 (A) ORGANISM: RS-AFP2 
141 

142 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

143 

144 Glx Lys Leu Cys Gin Arg Pro Ser Gly Thr Trp Ser Gly Val Cys Gly 

145 15 10 15 
146 

147 Asn Asn Asn Ala Cys Lys Asn Gin Cys lie Arg Leu Glu Lys Ala Arg 



PAGE: 7 RAW SEQUENCE LISTING DATE: 01/14/98 

PATENT APPLICATION US/08/842,898 TIME: 16:39:58 

INPUT SET: S22S56.mw 

148 20 25 30 

149 

150 His Gly Ser Cys Asn Tyr Val Phe Pro Ala His Lys Cys lie Cys Tyr 

151 35 40 45 
152 

15 3 Phe Pro Cys 
154 50 
155 
156 



157 (2) INFORMATION FOR SEQ ID NO: 5: 
158 

159 (i) SEQUENCE CHARACTERISTICS: 

160 (A) LENGTH: 50 amino acids 

161 (B) TYPE: amino acid 

162 (C) STRANDEDNESS : single 

163 (D) TOPOLOGY: linear 
164 

165 (ii) MOLECULE TYPE: protein 
166 

167 (vi) ORIGINAL SOURCE: 

168 (A) ORGANISM: Dm-AMPl 
169 

170 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
171 

172 Glu Leu Cys Glu Lys Ala Ser Lys Thr Trp Ser Gly Asn Cys Gly Asn 

173 15 10 15 
174 

175 Thr Gly His Cys Asp Asn Gin Cys Lys Ser Trp Glu Gly Ala Ala His 

176 20 25 30 
177 

178 Gly Ala Cys His Val Arg Asn Gly Lys His Met Cys Phe Cys Tyr Phe 

179 35 40 45 
180 

181 Asn Cys 

182 50 
183 

184 



185 
186 
187 
188 
189 
190 
191 
192 
193 
194 
195 
196 
197 
198 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: Cb-AMPl 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6; 



PAGE: 8 RAW SEQUENCE LISTING DATE: 01/14/98 

PATENT APPLICATION US/08/842,898 TIME: 16:40:02 

INPUT SET: S22SS6.raw 

199 

200 Glu Leu Cys Glu Lys Ala Ser Lys Thr Trp Ser Gly Asn Cys Gly Asn 

201 15 10 15 
202 

203 Thr Lys His Cys Asp Asp Gin Cys Lys Ser Trp Glu Gly Ala Ala His 

204 20 25 30 
205 

206 Gly Ala Cys His Val Arg Asn Gly Lys His Met Cys Phe Cys Tyr Phe 

207 35 40 45 
208 

209 

210 Asn Cys 

211 50 
212 

213 



366 (2) INFORMATION FOR SEQ ID NO: 13: 

367 (i) SEQUENCE CHARACTERISTICS: 

368 (A) LENGTH: 47 amino acids 

369 (B) TYPE: amino acid 

370 (C) STRANDEDNESS : single 

371 (D) TOPOLOGY: linear 
372 

373 (ii) MOLECULE TYPE: protein 
374 

375 (vi) ORIGINAL SOURCE: 

376 (A) ORGANISM: p322 
377 

378 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
379 

380 Arg His Cys Glu Ser Leu Ser His Arg Phe Lys Gly Pro Cys Thr Arg 

381 1 5 10 15 
382 

383 Asp Ser Asn Cys Ala Ser Val Cys Glu Thr Glu Arg Phe Ser Gly Gly 

384 20 25 30 
385 

386 Asn Cys His Gly Phe Arg Arg Arg Cys Phe Cys Thr Lys Pro Cys 

387 35 40 45 
388 



SEQUENCE VERIFICATION REPORT 

PATENT APPLICATION US/08/842,898 



DATE: 01/14/98 
TIME: 16:40:04 



INPUT SET: S22SS6.mw 



Original Text 



PAGE: 1 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/842,898 



DATE: 01/13/98 
TIME: 15:14:28 



INPUT SET: S22SS6.raw 



This Raw Listing contains the General 
Information Section and those Sequences 
containing ERRORS. 



1 SEQUENCE LISTING 

3 (1) General Information: ^fot Comniw 

5 (i) APPLICANT: BROEKAERT, WILLEM F. 'Veecfed 

6 CAMMUE, BRUNO P. A. 

7 OSBORN, RUPERT W. . •'^ . 

8 REES, SARAH B, ^ 
9 

10 (ii) TITLE OF INVENTION: ANTIMICROBIAL PROTEINS 
11 

~> ^ 12 (iii) NUMBER OF SEQUENCES: 13 

13 

14 (iv) CORRESPONDENCE ADDRESS: 

15 (A) ADDRESSEE: PILLSBURY MADISON & SUTRO LLP 

16 (B) STREET: 1100 New York Avenue, N.W. 

17 (C) CITY: Washington 

18 (D) STATE: D.C. 

19 (E) COUNTRY: U.S,A. 

20 (F) ZIP: 20005-3918 
21 

22 (V) COMPUTER READABLE FORM: 

2 3 (A) MEDIUM TYPE: Floppy disk 

24 (B) COMPUTER: IBM PC compatible 

25 (C) OPERATING SYSTEM: PC- DOS/MS- DOS 

26 (D) SOFTWARE: Microsoft Word 
27 

28 (vi) CURRENT APPLICATION DATA: 

29 (A) APPLICATION NUMBER: 

30 (B) FILING DATE: 22-OCT-1996 

31 (C) CLASSIFICATION: 
32 

33 (vii) PRIOR APPLICATION DATA: 

34 (A) APPLICATION NUMBER: US 08/656,318 

35 (B) FILING DATE: 12-JUN-1996 
36 

37 (Vii) PRIOR APPLICATION DATA: 

38 (A) APPLICATION NUMBER: PCT/GB94/02766 

39 (B) FILING DATE: 19-DEC-1994 
40 

41 (vii) PRIOR APPLICATION DATA: 

42 (A) APPLICATION NUMBER: GB 9326424.0 

43 (B) FILING DATE: 24-DEC-1993 
44 

45 



PAGE: 2 RAW SEQUENCE LISTING DATE: 01/13/98 

PATENT APPLICATION US/08/842,898 TIME: 15:14:31 





46 




47 


— > 


48 


— > 


49 


— > 


50 


— > 


51 


— > 


52 




53 


— > 


54 




55 


— > 


56 


— > 


57 




58 


— > 


59 




60 




61 




62 




63 




64 




65 




66 




67 




68 




69 




70 




71 




72 




73 



f 



INPUT SET: S225S6.mw 



2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: Hs-AFPl 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Asp Gly Val Lys Leu Cys Asp Val Pro Ser Gly Thr Trp Ser Gly His 
15 10 15 

Cys Gly Ser Ser Ser Lys Cys Ser Gin Gin Cys Lys Asp Arg Glu His 
20 25 30 

Phe Ala Tyr Gly Gly Ala Cys His Tyr Gin Phe Pro Ser Val Lys Cys 
35 40 45 

Phe Cys Lys Arg Gin Cys 
50 



ERRORED SEQUENCES FOLLOW: 



102 (2) INFORMATION FOR SEQ ID NO: 3: 

103 (i) SEQUENCE CHARACTERISTICS: 

104 (A) LENGTH: 51 amino acids 

105 (B) TYPE: amino acid 

106 (C) STRANDEDNESS: single 

107 (D) TOPOLOGY: linear 
108 

109 (ii) MOLECULE TYPE: protein 
110 

111 (vi) ORIGINAL SOURCE: 

112 (A) ORGANISM: Rs-AFPl 
113 

114 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
115 

116 Glx Lys Leu Cys Glu Arg Pro Ser Gly Thr Trp Ser Gly Val Cys Gly 

119 Asn Asn Asn Ala Cys Lys Asn Gin Cys lie Asn Leu Glu Lys Ala Arg 

120 20 25 30 



PAGE: 3 RAW SEQUENCE LISTING DATE: 01/13/98 

PATENT APPLICATION US/08/842,898 TIME: 15:14:35 

INPUT SET: S22SS6.mw 

121 

122 His Qly Ser Cys Asn Tyr Val Phe Pro Ala His Lys Cys lie Cys Tyr 

123 35 40 45 
124 

125 Phe Pro Cys 

126 50 
127 

128 



129 (2) INFORMATION FOR SEQ ID NO: 4: 
• 130 

131 (i) SEQUENCE CHARACTERISTICS: 

132 (A) LENGTH: 51 amino acids 

133 (B) TYPE: amino acid 

134 (C) STRANDEDNESS : single 

135 (D) TOPOLOGY: linear 
136 

137 (ii) MOLECULE TYPE: protein 
138 

139 (vi) ORIGINAL SOURCE: 

140 (A) ORGANISM: RS-AFP2 
141 

142 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

143 

144 Lys Leu Cys Gin Arg Pro Ser Gly Thr Trp Ser Gly Val Cys Gly 

— > 145 ^iZJ 5 10 15 

146 

147 Asn Asn Asn Ala Cys Lys Asn Gin Cys lie Arg Leu Glu Lys Ala Arg 

148 20 25 30 
149 

150 His Gly Ser Cys Asn Tyr Val Phe Pro Ala His Lys Cys lie Cys Tyr 

151 35 40 45 
152 

153 Phe Pro Cys 

154 50 
155 

156 



157 (2) INFORMATION FOR SEQ ID NO: 5: 
158 

15 9 (i) SEQUENCE CHARACTERISTICS: 

160 (A) LENGTH: 50 amino acids 

161 (B) TYPE: amino acid 

162 (C) STRANDEDNESS: single 

163 (D) TOPOLOGY: linear 
164 

165 (ii) MOLECULE TYPE: protein 
166 

167 (vi) ORIGINAL SOURCE: 

168 (A) ORGANISM: Dm-AMPl 
169 

170 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
171 



PAGE: 4 



RAW SEQUENCE LISTING 

PATENT APPLICATION mm/842,898 



DATE: 01/13/98 
TIME: 15:14:38 



INPUT SET: S22SS6.mw 
172 J3lu Leu Cys Olu Lys Ala Ser Lys Thr Trp Ser Gly Asn Cys Qly Asn 
— > 173 ri) 5 10 15 

174 ^ 

175 Thr Gly His Cys Asp Asn Gin Cys Lys Ser Trp Glu Gly Ala Ala His 

176 20 25 30 
177 

178 Gly Ala Cys His Val Arg Asn Gly Lys His Met Cys Phe Cys Tyr Phe 

179 35 40 45 
180 

181 Asn Cys 

182 50 
183 

184 

185 (2) INFORMATION FOR SEQ ID NO: 6: 
186 

187 (i) SEQUENCE CHARACTERISTICS: 

188 (A) LENGTH: 50 amino acids 

189 (B) TYPE: amino acid 

190 (C) STRANDEDNESS : single 

191 (D) TOPOLOGY: linear 
192 

193 (ii) MOLECULE TYPE: protein 
194 

195 (vi) ORIGINAL SOURCE: 
~> 196 (A) ORGA^^SM: Cb-AMPl 

197 OitrAl^im 

198 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
199 

200 Glu Leu Cys Glu Lys Ala Ser Lys Thr Trp Ser Gly Asn Cys Gly Asn 

201 15 10 15 
202 

203 Thr Lys His Cys Asp Asp Gin Cys Lys Ser Trp Glu Gly Ala Ala His 

204 20 25 30 
205 

206 Gly Ala Cys His Val Arg Asn Gly Lys His Met Cys Phe Cys Tyr Phe 

207 35 40 45 
208 

209 

210 Asn Cys 

211 50 
212 

213 

366 (2) INFORMATION FOR SEQ ID NO: 13: 

367 (i) SEQUENCE CHARACTERISTICS: 

368 (A) LENGTH: 47 amino acids 

C.r 369 (B) TYPE: amino acid 
370 (C) STRANDEDNESS: single 
371 (D) TOPOLOGY: linear 
372 

373 (ii) MOLECULE TYPE: protein 



PAGE: 5 RAW SEQUENCE LISTING DATE: 01/13/98 

PATENT APPLICATION US/08/842,898 TIME: 15:14:41 

INPUT SET: S22SS6.raw 

375 (vi) ORIGINAL SOURCE: 

376 (A) ORGANISM: p322 
377 

378 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
379 

380 Arg His Cys Glu Ser Leu Ser His Arg Phe Lys Gly Pro Cys Thr Arg 

381 15 10 15 
382 

383 Asp Ser Asn Cys Ala Ser Val Cys Glu Thr Glu Arg Phe Ser Gly Gly 

384 20 25 30 
385 

386 Asn Cys His Gly Phe Arg Arg Arg Cys Phe Cys Thr Lys Pro Cys 

387 35 40 45 
388 



- ' • 

PAGE: 1 SEQUENCE VERIFICATION REPORT DATE: 01/13/98 

PATENT APPLICATION US/08/842,898 TIME: 15:14:44 

INPUT SET: S225S6.mw 



Line Error Original Text 

12 Number of Sequences (13) Doesn't Equal Actual Count (12) (iii) NUMBER OF SEQUENCES: 13 

48 Unknown or Misplaced Identifier (i) SEQUENCE CHARACTERISTICS: 

49 Unknown or Misplaced Identifier (A) LENGTH: 54 amino acids 

50 Unknown or Misplaced Identifier (B) TYPE: amino acid 

5 1 Unknown or Misplaced Identifier (C) STRANDEDNESS: single 

52 Unknown or Misplaced Identifier (D) TOPOLOGY: linear 

54 Unknown or Misplaced Identifier (ii) MOLECULE TYPE: protein 

56 Unknown or Misplaced Identifier (vi) ORIGINAL SOURCE: 

57 Unknown or Misplaced Identifier (A) ORGANISM: Hs-AFPl 

59 Unknown or Misplaced Identifier (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 : 

117 Wrong Amino Acid Designator 1 5 10 15 

145 Wrong Amino Acid Designator 1 5 10 15 

173 Wrong Amino Acid Designator 1 5 10 15 

196 Unknown or Misplaced l|<Mitifier (A) ORGAh-ISM: Cb-AMPl 



